Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 323 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 32.9 KiB |
| Average record size in memory | 104.4 B |
Variable types
| Numeric | 13 |
|---|
LOAD is highly correlated with AIR FLOW | High correlation |
AIR FLOW is highly correlated with LOAD | High correlation |
APH-A-inlet-O2 is highly correlated with APH-B-inlet-O2 and 2 other fields | High correlation |
APH-B-inlet-O2 is highly correlated with APH-A-inlet-O2 | High correlation |
APH-A-outlet-O2 is highly correlated with APH-A-inlet-O2 and 1 other fields | High correlation |
APH-B-outlet-O2 is highly correlated with APH-A-inlet-O2 and 1 other fields | High correlation |
APH OUTLET TEMP-PASSA is highly correlated with APH OUTLET TEMP-PASSB | High correlation |
APH OUTLET TEMP-PASSB is highly correlated with APH OUTLET TEMP-PASSA | High correlation |
df_index is uniformly distributed | Uniform |
df_index has unique values | Unique |
LOAD has unique values | Unique |
COAL FLOW has unique values | Unique |
APH-A-inlet-O2 has unique values | Unique |
APH-B-inlet-O2 has unique values | Unique |
APH-A-outlet-O2 has unique values | Unique |
APH-B-outlet-O2 has unique values | Unique |
APH OUTLET TEMP-PASSA has unique values | Unique |
APH OUTLET TEMP-PASSB has unique values | Unique |
Reproduction
| Analysis started | 2022-09-21 13:36:15.761222 |
|---|---|
| Analysis finished | 2022-09-21 13:36:31.703680 |
| Duration | 15.94 seconds |
| Software version | pandas-profiling v2.12.0 |
| Download configuration | config.yaml |
| Distinct | 323 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 162 |
| Minimum | 1 |
|---|---|
| Maximum | 323 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 17.1 |
| Q1 | 81.5 |
| median | 162 |
| Q3 | 242.5 |
| 95-th percentile | 306.9 |
| Maximum | 323 |
| Range | 322 |
| Interquartile range (IQR) | 161 |
Descriptive statistics
| Standard deviation | 93.3862945 |
|---|---|
| Coefficient of variation (CV) | 0.576458608 |
| Kurtosis | -1.2 |
| Mean | 162 |
| Median Absolute Deviation (MAD) | 81 |
| Skewness | 0 |
| Sum | 52326 |
| Variance | 8721 |
| Monotonicity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1 | 1 | 0.3% |
| 243 | 1 | 0.3% |
| 221 | 1 | 0.3% |
| 220 | 1 | 0.3% |
| 219 | 1 | 0.3% |
| 218 | 1 | 0.3% |
| 217 | 1 | 0.3% |
| 216 | 1 | 0.3% |
| 215 | 1 | 0.3% |
| 214 | 1 | 0.3% |
| Other values (313) | 313 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 |
| Value | Count | Frequency (%) |
| 323 | 1 | |
| 322 | 1 | |
| 321 | 1 | |
| 320 | 1 | |
| 319 | 1 |
DUST
Real number (ℝ≥0)
| Distinct | 108 |
|---|---|
| Distinct (%) | 33.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 79.99077399 |
| Minimum | 30 |
|---|---|
| Maximum | 329 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.6 KiB |
Quantile statistics
| Minimum | 30 |
|---|---|
| 5-th percentile | 36 |
| Q1 | 46 |
| median | 68 |
| Q3 | 102 |
| 95-th percentile | 167.6 |
| Maximum | 329 |
| Range | 299 |
| Interquartile range (IQR) | 56 |
Descriptive statistics
| Standard deviation | 44.3386767 |
|---|---|
| Coefficient of variation (CV) | 0.5542973831 |
| Kurtosis | 5.030811717 |
| Mean | 79.99077399 |
| Median Absolute Deviation (MAD) | 26 |
| Skewness | 1.783696657 |
| Sum | 25837.02 |
| Variance | 1965.918251 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 37 | 15 | 4.6% |
| 38 | 9 | 2.8% |
| 51 | 9 | 2.8% |
| 59 | 9 | 2.8% |
| 34 | 9 | 2.8% |
| 36 | 8 | 2.5% |
| 82 | 8 | 2.5% |
| 40 | 8 | 2.5% |
| 60 | 7 | 2.2% |
| 41 | 7 | 2.2% |
| Other values (98) | 234 |
| Value | Count | Frequency (%) |
| 30 | 2 | 0.6% |
| 32 | 1 | 0.3% |
| 33 | 3 | 0.9% |
| 34 | 9 | |
| 36 | 8 |
| Value | Count | Frequency (%) |
| 329 | 1 | 0.3% |
| 299 | 1 | 0.3% |
| 252 | 1 | 0.3% |
| 203 | 1 | 0.3% |
| 202 | 4 |
| Distinct | 323 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 249.1974401 |
| Minimum | 179.7337494 |
|---|---|
| Maximum | 292.5237427 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.6 KiB |
Quantile statistics
| Minimum | 179.7337494 |
|---|---|
| 5-th percentile | 191.0587097 |
| Q1 | 235.9279251 |
| median | 262.0897217 |
| Q3 | 269.5638428 |
| 95-th percentile | 282.1558716 |
| Maximum | 292.5237427 |
| Range | 112.7899933 |
| Interquartile range (IQR) | 33.63591766 |
Descriptive statistics
| Standard deviation | 27.49003851 |
|---|---|
| Coefficient of variation (CV) | 0.1103142893 |
| Kurtosis | -0.2015756694 |
| Mean | 249.1974401 |
| Median Absolute Deviation (MAD) | 14.58752441 |
| Skewness | -0.8524820081 |
| Sum | 80490.77315 |
| Variance | 755.7022172 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 219.169281 | 1 | 0.3% |
| 210.4389496 | 1 | 0.3% |
| 287.0630798 | 1 | 0.3% |
| 286.9558716 | 1 | 0.3% |
| 285.2581787 | 1 | 0.3% |
| 280.194458 | 1 | 0.3% |
| 266.619812 | 1 | 0.3% |
| 265.3027039 | 1 | 0.3% |
| 265.5824585 | 1 | 0.3% |
| 265.006897 | 1 | 0.3% |
| Other values (313) | 313 |
| Value | Count | Frequency (%) |
| 179.7337494 | 1 | |
| 179.8209839 | 1 | |
| 179.858902 | 1 | |
| 179.9351196 | 1 | |
| 180.1061707 | 1 |
| Value | Count | Frequency (%) |
| 292.5237427 | 1 | |
| 290.6046753 | 1 | |
| 290.4057007 | 1 | |
| 290.3713074 | 1 | |
| 288.2381897 | 1 |
SOX
Real number (ℝ≥0)
| Distinct | 254 |
|---|---|
| Distinct (%) | 78.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1833.30031 |
| Minimum | 10 |
|---|---|
| Maximum | 2482 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.6 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 1541.1 |
| Q1 | 1651 |
| median | 1804 |
| Q3 | 2094 |
| 95-th percentile | 2336.9 |
| Maximum | 2482 |
| Range | 2472 |
| Interquartile range (IQR) | 443 |
Descriptive statistics
| Standard deviation | 381.4239253 |
|---|---|
| Coefficient of variation (CV) | 0.2080531614 |
| Kurtosis | 10.67276954 |
| Mean | 1833.30031 |
| Median Absolute Deviation (MAD) | 195 |
| Skewness | -2.433229497 |
| Sum | 592156 |
| Variance | 145484.2108 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1602 | 13 | 4.0% |
| 10 | 6 | 1.9% |
| 1798 | 5 | 1.5% |
| 1781 | 3 | 0.9% |
| 1746 | 3 | 0.9% |
| 2109 | 3 | 0.9% |
| 2098 | 3 | 0.9% |
| 1792 | 3 | 0.9% |
| 1571 | 2 | 0.6% |
| 2310 | 2 | 0.6% |
| Other values (244) | 280 |
| Value | Count | Frequency (%) |
| 10 | 6 | |
| 11 | 2 | 0.6% |
| 1495 | 1 | 0.3% |
| 1525 | 1 | 0.3% |
| 1529 | 1 | 0.3% |
| Value | Count | Frequency (%) |
| 2482 | 1 | |
| 2475 | 1 | |
| 2449 | 1 | |
| 2414 | 1 | |
| 2397 | 1 |
| Distinct | 323 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 190.0611782 |
| Minimum | 130.7405396 |
|---|---|
| Maximum | 226.2332153 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.6 KiB |
Quantile statistics
| Minimum | 130.7405396 |
|---|---|
| 5-th percentile | 142.1273544 |
| Q1 | 178.2989807 |
| median | 192.4747162 |
| Q3 | 205.2574158 |
| 95-th percentile | 224.462825 |
| Maximum | 226.2332153 |
| Range | 95.49267578 |
| Interquartile range (IQR) | 26.95843506 |
Descriptive statistics
| Standard deviation | 22.25605299 |
|---|---|
| Coefficient of variation (CV) | 0.1170994161 |
| Kurtosis | -0.01180940084 |
| Mean | 190.0611782 |
| Median Absolute Deviation (MAD) | 13.86590576 |
| Skewness | -0.6001190738 |
| Sum | 61389.76056 |
| Variance | 495.3318945 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 160.2473755 | 1 | 0.3% |
| 175.8517303 | 1 | 0.3% |
| 224.6723785 | 1 | 0.3% |
| 224.8042908 | 1 | 0.3% |
| 222.2261353 | 1 | 0.3% |
| 216.4401703 | 1 | 0.3% |
| 216.5605469 | 1 | 0.3% |
| 206.3406219 | 1 | 0.3% |
| 203.2284241 | 1 | 0.3% |
| 199.2761383 | 1 | 0.3% |
| Other values (313) | 313 |
| Value | Count | Frequency (%) |
| 130.7405396 | 1 | |
| 131.7583466 | 1 | |
| 133.6048889 | 1 | |
| 133.833374 | 1 | |
| 134.2231598 | 1 |
| Value | Count | Frequency (%) |
| 226.2332153 | 1 | |
| 226.0489197 | 1 | |
| 225.3103638 | 1 | |
| 225.046463 | 1 | |
| 224.9169922 | 1 |
| Distinct | 151 |
|---|---|
| Distinct (%) | 46.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 935.3343653 |
| Minimum | 764 |
|---|---|
| Maximum | 1086 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.6 KiB |
Quantile statistics
| Minimum | 764 |
|---|---|
| 5-th percentile | 770.1 |
| Q1 | 856 |
| median | 966 |
| Q3 | 1004.5 |
| 95-th percentile | 1052.4 |
| Maximum | 1086 |
| Range | 322 |
| Interquartile range (IQR) | 148.5 |
Descriptive statistics
| Standard deviation | 86.63563108 |
|---|---|
| Coefficient of variation (CV) | 0.09262530523 |
| Kurtosis | -1.080989847 |
| Mean | 935.3343653 |
| Median Absolute Deviation (MAD) | 57 |
| Skewness | -0.4401245278 |
| Sum | 302113 |
| Variance | 7505.732573 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 768 | 8 | 2.5% |
| 857 | 7 | 2.2% |
| 989 | 7 | 2.2% |
| 978 | 6 | 1.9% |
| 948 | 6 | 1.9% |
| 1014 | 6 | 1.9% |
| 856 | 5 | 1.5% |
| 858 | 5 | 1.5% |
| 977 | 5 | 1.5% |
| 988 | 5 | 1.5% |
| Other values (141) | 263 |
| Value | Count | Frequency (%) |
| 764 | 1 | 0.3% |
| 767 | 3 | 0.9% |
| 768 | 8 | |
| 769 | 4 | |
| 770 | 1 | 0.3% |
| Value | Count | Frequency (%) |
| 1086 | 1 | |
| 1067 | 1 | |
| 1066 | 1 | |
| 1065 | 1 | |
| 1064 | 2 |
| Distinct | 323 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.124628761 |
| Minimum | 1.775512934 |
|---|---|
| Maximum | 6.191920757 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.6 KiB |
Quantile statistics
| Minimum | 1.775512934 |
|---|---|
| 5-th percentile | 2.111430192 |
| Q1 | 2.511747837 |
| median | 2.988306761 |
| Q3 | 3.490730524 |
| 95-th percentile | 4.811655188 |
| Maximum | 6.191920757 |
| Range | 4.416407824 |
| Interquartile range (IQR) | 0.978982687 |
Descriptive statistics
| Standard deviation | 0.8309047984 |
|---|---|
| Coefficient of variation (CV) | 0.265921126 |
| Kurtosis | 1.129782244 |
| Mean | 3.124628761 |
| Median Absolute Deviation (MAD) | 0.4914426804 |
| Skewness | 1.094738012 |
| Sum | 1009.25509 |
| Variance | 0.6904027841 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 2.645081758 | 1 | 0.3% |
| 3.989323139 | 1 | 0.3% |
| 2.535947561 | 1 | 0.3% |
| 2.485507727 | 1 | 0.3% |
| 2.285168886 | 1 | 0.3% |
| 2.267483711 | 1 | 0.3% |
| 2.810749292 | 1 | 0.3% |
| 3.280973196 | 1 | 0.3% |
| 3.069856644 | 1 | 0.3% |
| 3.384076595 | 1 | 0.3% |
| Other values (313) | 313 |
| Value | Count | Frequency (%) |
| 1.775512934 | 1 | |
| 1.814382911 | 1 | |
| 1.85340929 | 1 | |
| 1.898377299 | 1 | |
| 1.944901109 | 1 |
| Value | Count | Frequency (%) |
| 6.191920757 | 1 | |
| 6.16395998 | 1 | |
| 5.692596436 | 1 | |
| 5.581057549 | 1 | |
| 5.547461987 | 1 |
| Distinct | 323 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.012168919 |
| Minimum | 1.698701382 |
|---|---|
| Maximum | 5.354305744 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.6 KiB |
Quantile statistics
| Minimum | 1.698701382 |
|---|---|
| 5-th percentile | 2.104239321 |
| Q1 | 2.489782929 |
| median | 2.909387112 |
| Q3 | 3.367511749 |
| 95-th percentile | 4.412256718 |
| Maximum | 5.354305744 |
| Range | 3.655604362 |
| Interquartile range (IQR) | 0.8777288198 |
Descriptive statistics
| Standard deviation | 0.7047507468 |
|---|---|
| Coefficient of variation (CV) | 0.2339678702 |
| Kurtosis | 0.3767929664 |
| Mean | 3.012168919 |
| Median Absolute Deviation (MAD) | 0.4328315258 |
| Skewness | 0.8415969742 |
| Sum | 972.9305608 |
| Variance | 0.496673615 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 2.914422512 | 1 | 0.3% |
| 3.667276144 | 1 | 0.3% |
| 2.838121176 | 1 | 0.3% |
| 2.895278454 | 1 | 0.3% |
| 2.557571411 | 1 | 0.3% |
| 2.621006966 | 1 | 0.3% |
| 2.902625322 | 1 | 0.3% |
| 3.32805872 | 1 | 0.3% |
| 2.997571945 | 1 | 0.3% |
| 3.193569183 | 1 | 0.3% |
| Other values (313) | 313 |
| Value | Count | Frequency (%) |
| 1.698701382 | 1 | |
| 1.703244209 | 1 | |
| 1.785511613 | 1 | |
| 1.925692677 | 1 | |
| 1.941848159 | 1 |
| Value | Count | Frequency (%) |
| 5.354305744 | 1 | |
| 5.324774265 | 1 | |
| 5.001795769 | 1 | |
| 4.952135086 | 1 | |
| 4.886171818 | 1 |
| Distinct | 323 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.267019702 |
| Minimum | 2.925982952 |
|---|---|
| Maximum | 7.331449509 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.6 KiB |
Quantile statistics
| Minimum | 2.925982952 |
|---|---|
| 5-th percentile | 3.223829222 |
| Q1 | 3.704240203 |
| median | 4.144909859 |
| Q3 | 4.64294076 |
| 95-th percentile | 5.9407269 |
| Maximum | 7.331449509 |
| Range | 4.405466557 |
| Interquartile range (IQR) | 0.9387005568 |
Descriptive statistics
| Standard deviation | 0.8074154325 |
|---|---|
| Coefficient of variation (CV) | 0.1892223352 |
| Kurtosis | 1.048130837 |
| Mean | 4.267019702 |
| Median Absolute Deviation (MAD) | 0.4696316719 |
| Skewness | 1.007382697 |
| Sum | 1378.247364 |
| Variance | 0.6519196806 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 3.774693012 | 1 | 0.3% |
| 4.993573666 | 1 | 0.3% |
| 3.417258739 | 1 | 0.3% |
| 3.310585022 | 1 | 0.3% |
| 3.092291117 | 1 | 0.3% |
| 3.220851421 | 1 | 0.3% |
| 3.623283386 | 1 | 0.3% |
| 4.094760418 | 1 | 0.3% |
| 3.924095631 | 1 | 0.3% |
| 4.245200634 | 1 | 0.3% |
| Other values (313) | 313 |
| Value | Count | Frequency (%) |
| 2.925982952 | 1 | |
| 2.926098347 | 1 | |
| 2.943948269 | 1 | |
| 2.968452454 | 1 | |
| 2.994398594 | 1 |
| Value | Count | Frequency (%) |
| 7.331449509 | 1 | |
| 7.222032547 | 1 | |
| 6.696996212 | 1 | |
| 6.626823902 | 1 | |
| 6.568151474 | 1 |
| Distinct | 323 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.267019702 |
| Minimum | 2.925982952 |
|---|---|
| Maximum | 7.331449509 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.6 KiB |
Quantile statistics
| Minimum | 2.925982952 |
|---|---|
| 5-th percentile | 3.223829222 |
| Q1 | 3.704240203 |
| median | 4.144909859 |
| Q3 | 4.64294076 |
| 95-th percentile | 5.9407269 |
| Maximum | 7.331449509 |
| Range | 4.405466557 |
| Interquartile range (IQR) | 0.9387005568 |
Descriptive statistics
| Standard deviation | 0.8074154325 |
|---|---|
| Coefficient of variation (CV) | 0.1892223352 |
| Kurtosis | 1.048130837 |
| Mean | 4.267019702 |
| Median Absolute Deviation (MAD) | 0.4696316719 |
| Skewness | 1.007382697 |
| Sum | 1378.247364 |
| Variance | 0.6519196806 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 3.774693012 | 1 | 0.3% |
| 4.993573666 | 1 | 0.3% |
| 3.417258739 | 1 | 0.3% |
| 3.310585022 | 1 | 0.3% |
| 3.092291117 | 1 | 0.3% |
| 3.220851421 | 1 | 0.3% |
| 3.623283386 | 1 | 0.3% |
| 4.094760418 | 1 | 0.3% |
| 3.924095631 | 1 | 0.3% |
| 4.245200634 | 1 | 0.3% |
| Other values (313) | 313 |
| Value | Count | Frequency (%) |
| 2.925982952 | 1 | |
| 2.926098347 | 1 | |
| 2.943948269 | 1 | |
| 2.968452454 | 1 | |
| 2.994398594 | 1 |
| Value | Count | Frequency (%) |
| 7.331449509 | 1 | |
| 7.222032547 | 1 | |
| 6.696996212 | 1 | |
| 6.626823902 | 1 | |
| 6.568151474 | 1 |
NOX
Real number (ℝ≥0)
| Distinct | 220 |
|---|---|
| Distinct (%) | 68.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 645.1145511 |
| Minimum | 5 |
|---|---|
| Maximum | 989 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.6 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 445.1 |
| Q1 | 560 |
| median | 630 |
| Q3 | 764 |
| 95-th percentile | 874 |
| Maximum | 989 |
| Range | 984 |
| Interquartile range (IQR) | 204 |
Descriptive statistics
| Standard deviation | 165.5003027 |
|---|---|
| Coefficient of variation (CV) | 0.2565440547 |
| Kurtosis | 3.534633679 |
| Mean | 645.1145511 |
| Median Absolute Deviation (MAD) | 104 |
| Skewness | -1.184293198 |
| Sum | 208372 |
| Variance | 27390.35019 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 620 | 12 | 3.7% |
| 5 | 8 | 2.5% |
| 579 | 5 | 1.5% |
| 861 | 4 | 1.2% |
| 630 | 4 | 1.2% |
| 811 | 4 | 1.2% |
| 795 | 3 | 0.9% |
| 560 | 3 | 0.9% |
| 572 | 3 | 0.9% |
| 606 | 3 | 0.9% |
| Other values (210) | 274 |
| Value | Count | Frequency (%) |
| 5 | 8 | |
| 411 | 1 | 0.3% |
| 426 | 1 | 0.3% |
| 427 | 1 | 0.3% |
| 430 | 2 | 0.6% |
| Value | Count | Frequency (%) |
| 989 | 1 | |
| 962 | 1 | |
| 941 | 1 | |
| 922 | 1 | |
| 920 | 1 |
| Distinct | 323 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 132.4925833 |
| Minimum | 123.6230748 |
|---|---|
| Maximum | 147.1204224 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.6 KiB |
Quantile statistics
| Minimum | 123.6230748 |
|---|---|
| 5-th percentile | 126.4946259 |
| Q1 | 128.8385773 |
| median | 131.194163 |
| Q3 | 135.6556422 |
| 95-th percentile | 141.9840373 |
| Maximum | 147.1204224 |
| Range | 23.49734751 |
| Interquartile range (IQR) | 6.817064921 |
Descriptive statistics
| Standard deviation | 4.864357316 |
|---|---|
| Coefficient of variation (CV) | 0.03671418577 |
| Kurtosis | 0.3675239353 |
| Mean | 132.4925833 |
| Median Absolute Deviation (MAD) | 2.81775411 |
| Skewness | 0.8692218146 |
| Sum | 42795.10441 |
| Variance | 23.6619721 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 128.8274587 | 1 | 0.3% |
| 123.7879257 | 1 | 0.3% |
| 138.3758341 | 1 | 0.3% |
| 138.1249339 | 1 | 0.3% |
| 137.9066264 | 1 | 0.3% |
| 137.1219126 | 1 | 0.3% |
| 136.874644 | 1 | 0.3% |
| 136.1736603 | 1 | 0.3% |
| 136.0526021 | 1 | 0.3% |
| 134.6970011 | 1 | 0.3% |
| Other values (313) | 313 |
| Value | Count | Frequency (%) |
| 123.6230748 | 1 | |
| 123.7879257 | 1 | |
| 124.5887451 | 1 | |
| 124.601476 | 1 | |
| 124.8307521 | 1 |
| Value | Count | Frequency (%) |
| 147.1204224 | 1 | |
| 147.0744019 | 1 | |
| 146.6251068 | 1 | |
| 146.5697581 | 1 | |
| 146.4863485 | 1 |
| Distinct | 323 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 135.8611184 |
| Minimum | 126.9053472 |
|---|---|
| Maximum | 150.4699198 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.6 KiB |
Quantile statistics
| Minimum | 126.9053472 |
|---|---|
| 5-th percentile | 130.5052981 |
| Q1 | 132.3298518 |
| median | 134.3119303 |
| Q3 | 139.0307592 |
| 95-th percentile | 145.0121429 |
| Maximum | 150.4699198 |
| Range | 23.56457265 |
| Interquartile range (IQR) | 6.700907389 |
Descriptive statistics
| Standard deviation | 4.834372151 |
|---|---|
| Coefficient of variation (CV) | 0.03558319117 |
| Kurtosis | 0.366054226 |
| Mean | 135.8611184 |
| Median Absolute Deviation (MAD) | 2.667055766 |
| Skewness | 0.9263984428 |
| Sum | 43883.14126 |
| Variance | 23.37115409 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 131.4184875 | 1 | 0.3% |
| 126.9053472 | 1 | 0.3% |
| 141.9561412 | 1 | 0.3% |
| 141.7056173 | 1 | 0.3% |
| 141.472407 | 1 | 0.3% |
| 140.7670492 | 1 | 0.3% |
| 140.5693766 | 1 | 0.3% |
| 139.6246847 | 1 | 0.3% |
| 139.5046438 | 1 | 0.3% |
| 138.5139211 | 1 | 0.3% |
| Other values (313) | 313 |
| Value | Count | Frequency (%) |
| 126.9053472 | 1 | |
| 127.011289 | 1 | |
| 127.767334 | 1 | |
| 127.8590597 | 1 | |
| 128.0790431 | 1 |
| Value | Count | Frequency (%) |
| 150.4699198 | 1 | |
| 150.3160706 | 1 | |
| 149.9199829 | 1 | |
| 149.895401 | 1 | |
| 149.8172201 | 1 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| df_index | DUST | LOAD | SOX | COAL FLOW | AIR FLOW | APH-A-inlet-O2 | APH-B-inlet-O2 | APH-A-outlet-O2 | APH-B-outlet-O2 | NOX | APH OUTLET TEMP-PASSA | APH OUTLET TEMP-PASSB | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 90.80 | 219.169281 | 2058.0 | 160.247375 | 771.0 | 2.645082 | 2.914423 | 3.774693 | 3.774693 | 669.0 | 128.827459 | 131.418488 |
| 1 | 2 | 44.00 | 193.337067 | 1905.0 | 139.956100 | 782.0 | 4.643019 | 4.294496 | 5.754969 | 5.754969 | 782.0 | 129.108368 | 132.330617 |
| 2 | 3 | 44.90 | 191.797440 | 1866.0 | 139.279434 | 773.0 | 4.615278 | 4.364353 | 5.611253 | 5.611253 | 758.0 | 132.004634 | 134.760106 |
| 3 | 4 | 40.12 | 191.705933 | 1870.0 | 140.341476 | 768.0 | 4.522539 | 4.274601 | 5.575839 | 5.575839 | 740.0 | 131.639511 | 134.136602 |
| 4 | 5 | 40.00 | 190.005066 | 1887.0 | 141.173676 | 768.0 | 4.230556 | 4.113577 | 5.336401 | 5.336401 | 754.0 | 131.172384 | 133.700633 |
| 5 | 6 | 41.00 | 189.752640 | 1806.0 | 140.826996 | 768.0 | 4.524929 | 4.193811 | 5.585227 | 5.585227 | 724.0 | 130.728358 | 133.301366 |
| 6 | 7 | 40.00 | 193.009094 | 1824.0 | 143.815170 | 771.0 | 4.227230 | 3.960650 | 5.290730 | 5.290730 | 706.0 | 130.654622 | 133.195791 |
| 7 | 8 | 41.00 | 191.858978 | 1854.0 | 140.718246 | 768.0 | 4.044431 | 4.127516 | 5.160898 | 5.160898 | 735.0 | 130.117528 | 132.558395 |
| 8 | 9 | 41.00 | 191.104523 | 1865.0 | 141.939819 | 767.0 | 4.361370 | 4.081351 | 5.404026 | 5.404026 | 719.0 | 129.291687 | 131.804967 |
| 9 | 10 | 44.20 | 191.053619 | 1865.0 | 140.984940 | 768.0 | 4.212238 | 4.090389 | 5.256097 | 5.256097 | 743.0 | 128.896029 | 131.466138 |
Last rows
| df_index | DUST | LOAD | SOX | COAL FLOW | AIR FLOW | APH-A-inlet-O2 | APH-B-inlet-O2 | APH-A-outlet-O2 | APH-B-outlet-O2 | NOX | APH OUTLET TEMP-PASSA | APH OUTLET TEMP-PASSB | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 313 | 314 | 180.0 | 275.933197 | 1602.0 | 200.920395 | 1060.0 | 2.730782 | 3.003725 | 3.795549 | 3.795549 | 620.0 | 141.814143 | 145.620738 |
| 314 | 315 | 183.0 | 290.405701 | 1602.0 | 225.310364 | 1066.0 | 2.007309 | 2.099038 | 3.068498 | 3.068498 | 620.0 | 146.202092 | 149.520752 |
| 315 | 316 | 190.0 | 292.523743 | 1602.0 | 224.609283 | 1064.0 | 2.086578 | 2.106927 | 3.150498 | 3.150498 | 620.0 | 146.569758 | 149.919983 |
| 316 | 317 | 202.0 | 290.604675 | 1602.0 | 224.645859 | 1067.0 | 2.463638 | 2.223314 | 3.568580 | 3.568580 | 620.0 | 146.486348 | 149.895401 |
| 317 | 318 | 202.0 | 284.613831 | 1602.0 | 224.514771 | 1062.0 | 2.822672 | 2.247865 | 3.869678 | 3.869678 | 620.0 | 147.074402 | 150.469920 |
| 318 | 319 | 202.0 | 280.991913 | 1602.0 | 224.756393 | 1058.0 | 2.609954 | 2.855597 | 3.706730 | 3.706730 | 620.0 | 147.120422 | 150.316071 |
| 319 | 320 | 202.0 | 281.414917 | 1602.0 | 224.468277 | 1065.0 | 2.553551 | 2.744499 | 3.681642 | 3.681642 | 620.0 | 146.625107 | 149.817220 |
| 320 | 321 | 252.0 | 275.979797 | 1602.0 | 225.046463 | 1053.0 | 2.936583 | 2.922196 | 3.923084 | 3.923084 | 620.0 | 146.369191 | 149.502660 |
| 321 | 322 | 299.0 | 276.770416 | 1602.0 | 224.916992 | 1047.0 | 2.883361 | 2.909387 | 4.035063 | 4.035063 | 620.0 | 146.193136 | 149.266301 |
| 322 | 323 | 193.0 | 279.781921 | 1602.0 | 224.502472 | 1055.0 | 2.843870 | 3.033844 | 3.755615 | 3.755615 | 620.0 | 142.019068 | 144.889893 |